Skip to content

[VL] Add concurrency control to GpuLock#12090

Open
marin-ma wants to merge 2 commits into
apache:mainfrom
marin-ma:support-gpu-concurrency
Open

[VL] Add concurrency control to GpuLock#12090
marin-ma wants to merge 2 commits into
apache:mainfrom
marin-ma:support-gpu-concurrency

Conversation

@marin-ma
Copy link
Copy Markdown
Contributor

Currently, GPULock allows only one Spark task to acquire GPU resources at a time, which effectively makes GPU offloading run in single-threaded mode within a Spark executor. However, when testing the TPCDS queries, we observed that GPU utilization and memory usage can be very low (< 20%), meaning GPU resources are underutilized.

To improve GPU utilization, this PR adds support for running multiple GPU tasks concurrently through the configuration spark.gluten.sql.columnar.backend.velox.cudf.concurrentGpuTasks. The default value is 1.

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@github-actions github-actions Bot added CORE works for Gluten Core VELOX DOCS labels May 13, 2026
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@marin-ma marin-ma force-pushed the support-gpu-concurrency branch from 795e48f to a3d9726 Compare May 27, 2026 09:59
@github-actions github-actions Bot removed the CORE works for Gluten Core label May 27, 2026
@marin-ma marin-ma force-pushed the support-gpu-concurrency branch from bd9eda7 to b851138 Compare May 27, 2026 14:13
@github-actions github-actions Bot added the INFRA label May 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant